PATENT ABSTRACTS OF JAPAN 



(11) Publication number : 2001-005489 
(43)Date of publication of application : 12.01.2001 



(51) Int. CI. G10L 15/22 

G10L 15/00 

G10L 15/28 

H04N 5/00 

H04Q 9/00 



(21) Application number 


: 2000- (71) Applicant 


: SONY INTERNATL EUR0P 




117217 


GMBH 


(22)Date of filing : 


13. 04. 2000 (72) Inventor : 


RAPP STEFAN 






SILKE G0R0NJII 






RALF K0NPE 






PETER BUCHNER 






GIRON FRANCK 






HELMUT LUCKE 


(30) Priority 






Priority 99 99107201 Priority 13. 04. 1999 Priority EP 


number 


date : 


country : 



(54) CONTROL METHOD OF NETWORK 

(57) Abstract: 

PROBLEM TO BE SOLVED: To speedingly and flexibly control network devices 
by converting received and recognized voice commands into corresponding 
user network commands based on all pairs of user network commands 
included in user com mand interpretation elements and general languages. 
SOLUTION: Vocabulary elements of first and second devices and 
corresponding commands LI and L2 do not include same vocabulary 
elements. ThereforeLl cap L2={} and a received language Lwhich is merged 
for interface descriptionbecomes L=L1 cup L2. In other wordsa general 
document in a voice device 1 is constructed by combining pairs of 
commands corresponding to the vocabulary elements obtained from the 
device documentwhich is described by the vocabulary elements of the 



first device and the command Lland the device document which is 
described by the vocabulary elements of the second device and the 
command L2. In other wordsuser' s uttered commands are converted into 
user network commands based on all pairs of user network commands 
included in user command interpretation elements and general languages. 



CLAIMS 



[Claim (s)] 

[Claim 1] Change into a user network command characterized by comprising 
the following related in a user command in an audio stationand by the 
above-mentioned related user network command. A control method of 
network equipment which controls network equipment connected to the 
above-mentioned network via a network. 

A receiving step which receives at least one apparatus document 
including a language which corresponds to the above-mentioned network 
equipment and consists of at least one pair of a user network command 
relevant to the above-mentioned user command interpretation element. 
An adaptation step which carries out adaptation of the apparatus 
document received [ above-mentioned ] to documents in general which 
consist of a language of this apparatus documentand a language of the 
above-mentioned audio station constituted similarly. 
A converting step which changes into a corresponding user network 
command a user' s voice commanding which was above-received and has been 
recognized based on all the pairs of the above-mentioned user command 
interpretation element and a user network command included in the above- 
mentioned general purpose language. 

[Claim 2]A step which the above-mentioned adaptation step judges for 
whether a language of the above-mentioned audio station and a language 
of a newly received apparatus document include at least one same user 
command interpretation elementWhen same user command interpretation 
element does not existupdate a language of the above-mentioned audio 
station and All the pairs of a user command interpretation element of a 
language of the above-mentioned audio stationand a related user network 
commandA step combined so that all all the pairs of a user command 
interpretation element of a language of a newly [ the above ] received 
apparatus document and a related user network command may be 
containedAll the pairs of a user command interpretation element of a 
language of the above-mentioned audio station which updates a language 



of the above-mentioned audio station and is not common when at least one 
same user command interpretation element existsand a related user 
network commandjoin together and all the pairs of a user command 
interpretation element of a language of a newly [ the above ] received 
apparatus documentand a related user network command All the pairs of 
same user command interpretation element of a language of the above- 
mentioned audio station and a related user network commandthe above — a 
control method of the network equipment according to claim 1 of having a 
step which gives an identifier which defines apparatus related to all 
the each of a pair of of a user command interpretation element of a 
language of a newly received apparatus documentand a related user 
network command. 

[Claim 3]A control method of the network equipment according to claim 
2wherein the above-mentioned identifier is a name of apparatus placed in 
frontor postposed and given to a user command interpretation element of 
each set of a user network command relevant to the above-mentioned user 
command interpretation element. 

[Claim 4]A11 the pairs of a user command interpretation element of a 
language of the above-mentioned audio stationand a related user network 
commanda step which combines all the pairs of a user command 
interpretation element of a language of a newly [ the above ] received 
apparatus documentand a related user network command and which carries 
out and updates a language of the above-mentioned audio station[ have 
and ] each user command interpretation element — a language of the 
above-mentioned audio stationand the above — a pair of user commands of 
all the of a user command interpretation element included in an 
intersection with a language of a newly received apparatus documentand a 
related user network commandA control method of the network equipment 
according to claim 1 changing into a user network command corresponding 
based on all the pairs and selection processes of a user command 
interpretation element included in the above-mentioned general-purpose 
documentand a related user network command. 

[Claim 5]A control method of the network equipment according to claim 4 
characterized by comprising the following. 

A step which transmits an inquiry about which apparatus is used for the 

above-mentioned selection process to a user. 

A step which receives and recognizes a reply from a user. 

A step which chooses a corresponding user network command based on all 

the pairs of a related user network command included in a reply and a 

user command interpretation elementand the above-mentioned documents in 

general from a user who has recognized. 



[Claim 6] A control method of the network equipment according to claim 4 
or 5 characterized by comprising the following. 

A step to which the above-mentioned selection process transmits inquiry 
which apparatus was used immediately before to the above-mentioned 
network. 

A step which receives and recognizes a reply from the above-mentioned 
network. 

A step which chooses a corresponding user network command based on all 
the pairs of a user network command included in a reply and a user 
command interpretation elementand the above-mentioned documents in 
general from a network which received [ above-mentioned ]. 

[Claim 7] A control method of network equipment of claim 4 thru/or 6 
given in any 1 paragraph characterized by comprising the following. 
A step which the above-mentioned selection process receives the above- 
mentioned user' s utteranceand is recognized. 

A step which chooses a corresponding user network command based on all 
the pairs of utterance of a user who received and a user command 
interpretation elementand a user network command included in the above- 
mentioned documents in general. 

[Claim 8]A control method of network equipment of claim 4 thru/or 7 
given in any 1 paragraph characterized by comprising the following. 
A step which transmits inquiry whether the above-mentioned selection 
process has a high possibility that which apparatus will be most used to 
the above-mentioned network. 

A step which receives a reply from the above-mentioned network. 
A step which chooses a corresponding user network command based on all 
the pairs of a related user network command included in a reply and a 
user command interpretation elementand the above-mentioned documents in 
general from the received above-mentioned network. 

[Claim 9]A control method of the network equipment according to claim 8 
that high network equipment of a possibility that ****** will also be 
used is chosen from network equipment in which a power supply is 
switched on. 

[Claim 10] A control method of network equipment of claim 1 thru/or 9 
characterized by transmitting an apparatus document corresponding to 
this network equipment to the above-mentioned audio station directly 
from predetermined apparatus after network equipment is connected to a 



network given in any 1 paragraph. 

[Claim 11] A control method of network equipment of claim 1 thru/or 10 
characterized by transmitting an apparatus document to the above- 
mentioned audio station from apparatus predetermined [ this ] after the 
above-mentioned audio station transmits a user network command to 
predetermined apparatus given in any 1 paragraph. 
[Claim 12]. [ whether network equipment corresponding to the above- 
mentioned apparatus document is connected to the above-mentioned 
networkand ] Or a control method of network equipment of claim 1 thru/or 
11 which will be characterized by transmitting this apparatus document 
to the above-mentioned audio station from this apparatus document 
providing device if a user network command which requires an apparatus 
document with an apparatus document providing device specific from the 
above-mentioned audio station is received given in any 1 paragraph. 
[Claim 13]A control method of network equipment of claim 1 thru/or 
12wherein a general-purpose document stored in the above-mentioned audio 
station is empty in first stage given in any 1 paragraph. 
[Claim 14]A control method of network equipment of claim 1 thru/or 
12wherein a general-purpose document stored in the above-mentioned audio 
station includes a pair of basic set of a user command interpretation 
element and a related user network command in first stage given in any 1 
paragraph. 

[Claim 15]A control method of the network equipment according to claim 
14wherein the above-mentioned user command interpretation element and a 
pair of corresponding basic set of a user network command define initial 
grammar about an expression of utterance. 

[Claim 16]A control method of network equipment of claim 1 thru/or 
15wherein the above-mentioned user command interpretation element 
contains a lexical element given in any 1 paragraph. 
[Claim 17] A control method of network equipment of claim 1 thru/or 
16wherein the above-mentioned user command interpretation element 
includes a definition of grammar about continuous utterance of a user 
who may appear based on a keyword and/or a category given in any 1 
paragraph. 

[Claim 18]A control method of network equipment of claim 1 thru/or 
17wherein the above-mentioned user command interpretation element 
includes a definition about pronunciation given in any 1 paragraph. 
[Claim 19] A control method of the network equipment according to claim 
18wherein a definition about the above-mentioned pronunciation is 
related with a language of arbitrary apparatus. 

[Claim 20]A control method of the network equipment according to claim 



18wherein a definition about the above-mentioned pronunciation is 

related with a language of two or more apparatus. 

[Claim 21] A control method of network equipment of claim 1 thru/or 

20wherein the above-mentioned user command interpretation element 

includes at least one word sequence given in any 1 paragraph. 

[Claim 22]The above-mentioned user command interpretation element is the 

control method of network equipment given in 1 paragraph to either of 

claims 1 thru/or 21 including a rule which defines that of arbitrary 

user commandsand mapping between concepts. 

[Claim 23] A control method of network equipment of claim 1 thru/or 
22wherein the above-mentioned user command interpretation element 
includes a user command interpretation element for other productive 
wordsand the same information as a related pair of a user network 
command given in any 1 paragraph. 

[Claim 24] A control method of network equipment given in any 1 paragraph 
of claims 1 thru/or 23wherein the above-mentioned user command 
interpretation element includes standardized information about a 
function of a category of apparatusand/or this apparatus. 
[Claim 25] A language of network equipmentor a language of the above- 
mentioned audio stationThis above-mentioned audio station to one 
received user command including a reasoning part by this reasoning part. 
A control method of network equipment given in the any 1 paragraph 
according to claim 1 to 24 transmitting several different user network 
commands to 2 or two or more network equipment based on information 
about a category and/or a function of network equipment. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the voice interface 
which can extend a vocabulary dynamically and actively in home network 
environment based on the vocabulary depending on the apparatus or the 
medium transmitted from network equipment. Especially this invention 
relates to the control method of the network equipment which can extend 
a vocabulary in the audio station which realizes a voice interface etc. 
Hardwaressuch as a videotape recorder (VTR)may constitute the device 
which applied this inventionfor exampleor softwaresuch as an electronic 
program guide (electronic programming guide)may be sufficient as itfor 
example. 



[0002] 

[Description of the Prior Art] In home network environmentthe apparatus 
which transmits the vocabulary and voice interface which describe their 
function to an audio station is indicated by European patent 
publication-bef ore-examination EP-A-97118470 No. An audio station 
changes into a corresponding user network command a user' s utterance 
received and recognizedand controls this apparatus based on this user 
network command. 

[0003] Motorola (Motorola) exhibited the language relevant to the Vox 
markup language 1. 0 (VoxMLl. 0) based on extensible markup language XML 
(below Extensible Markup Language: calls it XML.) in Septemberl998. This 
language is used for describing a dialog (dialog) system by specifying a 
prompt and the dialog step which generally consists of a list of 
available options. By hypermedia description language HTML (below 
Hypertext Markup Language: calls it HTML. ) with a rich text. According to 
VoxMLl. Odescription of voice application becomes easy so that a picturea 
hypertext linkand graphic user interface input control can be described 
easily. This VoxMLl. 0 includes the information about the pointer which 
directs other references which are useful when developing the 
composition of the syntax of the attribute of two or more elements and 
these elementsan examplea Vox markup language documentor a dialogand the 
application which uses a Vox markup language. 

[0004]Similarly by the Euro speech 97 Europe Acoustical Society of 
America-sponsored [ in the low toss island in Greece ]. "The paper 
announced by Richard SUPUR0T0 and others The markup language for text- 
to-speech synthesisISSN. 1774 pages of 1018-4074. "amarkup language for 
text-to-speech synthesis" by Richard Sproat et al. ESCA. Eurospeech 
97Rhodes[ Greece and ] The voice text markup language (spoken text 
markup language) (STML) is indicated by ISSN 1018-4074 and page 1774. 
This STML provides a text voice (TTS) composing device (text-to-speech 
synthesizers) with the knowledge about the composition of a text. For 
example in a multilingual TTS systema STML text sets up 
languageinitializes a speaker to the languageand a suitable language and 
a speaker specific table are loaded. 

[0005]Philips (Phillips) developed the dialog description language 
called HDDL. HDDL is a language for the dialog especially used in an 
automatic inquiry system. A dialogue system is sold after being created 
by offline mode using HDDL. 
[0006] 

[Problem(s) to be Solved by the Invent ion] Drawing 3 is a figure showing 
the conventional audio station 21. The audio station 21 is connected to 



the microphone 22the loudspeaker 30and the bus 40. After the input 
signal of the microphone 22 is processed by the digital signal processor 
(DSP) 23 which built in the memory 23ait is supplied to the central 
processing unit (CPU) 24. The result of an operation is outputted to the 
loudspeaker 30 via DSP29 which built in the memory 29aor CPU24 outputs 
it to the bus 40 via the link layer control section 25 and the I/F 
physical layer part 26. DSP23 and CPU24 can also access the memory 28. 
All the information required in order to control processing of an input 
signal is stored in the memory 28. DSP23 accesses the feature extraction 
part 27e of the memory 27. All the information required for speech 
recognition and voice synthesis is stored in the memory 27. CPU24 also 
accesses not only each voice interface definition part and general- 
purpose voice interface definition part for apparatus #lapparatus #2and 
apparatus #3 but a recognition partand the secretary matter / phoneme 
converter 27f two or more apparatus and here. Each voice interface 
definition part and a general-purpose voice interface definition part 
are independently memorized in the memory 27. 

[0007]Thussince it had the general-purpose voice interface definition 
part separately while providing the voice interface definition part 
according to apparatusthe command which two or more apparatus shares 
needed to be managed individuallystructure became complicatedand the 
efficiency of processing or analysis was bad in the conventional audio 
station 21. 

[0008]Thenthis invention is made in view of the actual condition 
mentioned aboveand is a thing. 

The purpose is to provide the control method of simple and efficient 
network equipment of transmitting the function of ** and network 
equipmentand a voice interface to a network audio stationand operating 
the function of two or more network equipmentand a voice interface in an 
audio station. 

[0009]That isthe purpose of this invention is [ for controlling within a 
network the network equipment connected to the network by an audio 
station ] easyis quickand is providing the method of being supple. An 
audio station is a device which changes a user command into a user 
network commandand controls network equipment via a network based on the 
function of network equipmentand a voice interface. 
[0010] 

[Means for Solving the Problem] According to this inventionall the 
network equipment connected to a network is equivalent to at least one 
apparatus document which defines a function of network equipmentand a 



voice interface. Network equipment itself may be provided with an 
apparatus documentor it may be drawn up to the exterior of network 
equipment, a pair with a user network command relevant to a user command 
interpretation element of network equipment in an apparatus document — 
1 set — or it has two or more sets. An audio station receives an 
apparatus documentunif ies a received apparatus documentand creates one 
voice interface description. This voice interface description integrated 
is created based on a language of an audio station. Voice interface 
description can be referred to as a general-purpose document for all the 
networks. A user' s utterance command received and recognized by audio 
station is changed into a user network command based on all the pairs of 
a user command interpretation element and a related user network command 
included in a general purpose language. A user command interpretation 
and an execution element have a definition of a lexical element and 
grammarand a pronunciation def initionfor example. 

[0011] the [ European patent gazette ] — an audio station is fetched 
from a database which exists in memory storage or a remote which an 
audio station equips with an apparatus document as indicated by EP-A- 
97118470 No. 

[0012]After an apparatus document is received from network equipment at 
the time of executionadaptation of the general-purpose document may be 
purely carried out syntactically in an audio station. Network equipment 
may have two or more apparatus documents. Each of those documents 
describe a part of function of network equipmentand some interfaces. And 
among thoseonly a document actually needed is transmitted to an audio 
station. Such a document is a document which defines a part of function 
of a document or network equipment which described a function of network 
equipment using a certain languagefor example. When it judges with an 
audio station or each network equipment itself needing further phonetic 
function of network equipment of these eachan apparatus document 
corresponding to the further phonetic function can be transmitted to an 
audio station. An audio station carries out adaptation of the general- 
purpose document based on this further documentand generates a user 
network command corresponding based on that general-purpose document by 
which adaptation was carried out at the time of execution. 
[0013] 

[Embodiment of the Invent ion] Drawing 1 is a figure showing the audio 
station 1 which applied this invention. The audio station 1 is connected 
to the microphone 2and the loudspeaker 10 and the bus 20. After the 
input signal based on the sound inputted into the microphone 2 is 
processed by the digital signal processor (DSP) 3 which built in the 



memory 3ait is supplied to the central processing unit (CPU) 4. The 
result of an operation is outputted to the loudspeaker 10 via DSP9 which 
built in the memory 9aor CPU4 outputs it to the bus 20 via the link 
layer control section 5 and the I/F physical layer part 6. DSP3 and CPU4 
can also access the memory 8. All the information required in order to 
control processing of an input signal is stored in the memory 8. DSP3 
accesses the feature extraction part 7e of the memory 7. All the 
information required for speech recognition and voice synthesis is 
stored in the memory 7. 

[0014] In the conventional audio stationtwo or more voice interface 
definition parts corresponding to each of two or more connected network 
equipment and one general-purpose voice interface definition part were 
provided. On the other handthe audio station 1 which applied this 
invention has the only voice interface definition part integrated. This 
voice interface definition part integrated supports the general-purpose 
document described by the conventional general-purpose voice interface 
definition part. 

[0015] Drawing 2 is a functional-blocks figure of the network equipment 
11 which applied this invention. The network equipment 11 is provided 
with CPU12. CPU12 carries out an interaction to the software 15 for 
appliance control for controlling the network equipment lithe memory 
14the link layer control section 17and the I/F physical layer part 16and 
outputs various information to the bus 10. The interaction of CPU12 can 
also be carried out to the memory 13. According to this inventionthe 
memory 13 has memorized a voice interface definitioni. e. lor two or more 
apparatus documents. 

[0016] As mentioned abovethe network equipment 11 does not necessarily 
need to be provided with the memory 13 which has a voice interface 
definition part. The voice interface definition of the network equipment 
11 may be provided with a voice interface definition providing device. 
The audio station 1 shown in drawing 1 can be accessed at a voice 
interface definition providing device. 

[0017] The two network equipment 11 is connected to the network provided 
with the one audio station 1 in the example of this invention explained 
below. Two apparatus documents in which the general-purpose document of 
the audio station 1 defines the language of the two network equipment 11 
by empty are merged into description of one interface in the audio 
station 1 in first stage (merge). The number of the network equipment 11 
connected to a network may be [ two or more / arbitrary ] n. Based on 
the newly received apparatus documentadaptation of the general-purpose 
document of the audio station 1 can be carried out. Belowin order to 



explain brieflyan apparatus document shall have a user command 
interpretation element which consists of one lexical element. 
[0018] In the following explanationLl shows the lexical element and the 
corresponding command of an acceptance language (accepted 
language) i. e. the 1st apparatus. L2 shows the 2nd lexical element and 
corresponding command of apparatus similarly. According to mathematical 
expressionLl is a set of at least one lexical elementi. e. the user 
network command corresponding to word w 4 and this word Wj. Word Wj may 
not be a single word but may be perfect utterance which consists of two 
or more words. An acceptance language may be added to a lexical element 
and may also include the rule for the grammar for pronunciation and a 
word sequence and/or speech comprehension (speech understanding) and a 
dialogfor example. 

[0019] In the 1st exampleLl and L2 do not contain the same lexical 
elementi. e. a common word. Thereforethe acceptance language L which is 
L1**L2={} and was merged for interface description is set to L=L1**L2. 
That isthe general-purpose document in the audio station 1 is built by 
annexing the pair of the lexical element obtained from the apparatus 
document 1 described by the language LI of the 1st apparatusand the 
apparatus document 2 described by the language L2 of the 2nd 
apparatusand a corresponding command. Since LI and L2 do not contain the 
same lexical elementin order to suggest a thing [ as opposed to / 
command / corresponding / which apparatus in the lexical element ] a 
lexical element isa user network command is generated appropriately and 
is correctly transmitted to corresponding apparatus. 

[0020] In the 1st exampletwo network equipment is a television set and a 

CD playerfor example. At this timeLl corresponding to a television set 

and L2 corresponding to a CD player consist of the following lexical 

elements within an apparatus documentrespectively. 

[0021]L1={MTVCNN} 

L2= {reproduction and a stop} 

Since LI and L2 do not contain the same lexical elementi. e. it is 
Ll**L2={}the acceptance language L with which interface description was 
merged becomes L=L1**L2= {MTVCNNreproductionand a stop}. For 
examplethese lexical elements correspond with the user network command 
which has a function which the following lists are shownrespectively. 
[0022] 

> reproduction which switches CNN[ <switches television to MTV> ] ->< 
television to CNN -> carrying out only > stop -> <a CD player is made 
into stop mode) which makes < CD player reproduction mode. [ MTV-> ] 
When sharing the lexical element with two same apparatusi. e. the same 



wordit becomes L1**L2 !={}. According to this inventionit is 
identifiable also in a vocabulary common to these. The name of apparatus 
is prefaced or postposed in the 1st example of this invention by the 
same lexical element between each acceptance language (i. e. between LI 
and L2) that forms an intersection at least, thereforea user — before 
the name of each apparatus of a request of a user' s utterance command — 
and/orit must add back. As mentioned abovewhen a command is not commonit 
is not necessary to add the name of apparatus butand it may add. 
[0023] In the following example [ 2nd ]the name of apparatus is added 
before each command and the new language L which forms description of an 
interface is shown in the language of the apparatus by which annexation 
of a word without a possibility that it may be mixed up between the two 
languages LI and L2and the name of apparatus were prefaced. It will be 
set to L=L1** (L1**L2) **L2** (L1**L2) **nlLl**n2L2 if the name of 
apparatus is set to nl and n2respectively. 

[0024]Belowan above-mentioned example is described to an example for a 
network provided with a CD reproduction device and a tape reproducerand 
the control method of the network equipment by this invention is 
clarified. Herethe CD reproduction device and the tape reproducer are 
named "CD" and a "tape"respectively. The lexical element which 
constitutes the acceptance language LI of a CD reproduction device and 
the acceptance language L2 of a tape reproducerrespectively is shown 
below. 

[0025]L1= {reproductiona stopand a skip} 

L2= {playbacksound recordinga stopand rewinding} 

The acceptance language of a voice interface is L= (L1**(L1**L2) ** (it 
becomes L2**(L1**L2) **nlLl**n2L2= {a skipsound recordingrewindingCD 
reproductionCD stopCD skiptape recordinga tape stopand tape 
rewinding}.)). The function of a user network command to correspond is 
as follows. 
[0026] 

Skip -> > sound-recording -> [ </ which is skipped to the next track of 
CD ] <a tape reproducer is made into sound recording mode) Rewindand it 
skips to the next track of CD skip [ <suspends playback of a CD 
reproduction device) ] -><CD. [ CD stop / <makes a CD reproduction 
device reproduction mode) / -> ] [CD reproduction / <rewinds a tape) / 
_ > ] [ _ > ] The tape stop to carry out and <makes a tape reproducer 
sound recording mode) -> tape rewinding [ <which suspends playback/sound 
recording of a tape reproducer) ] -> <a tape is rewound) [ tape 
recording -> / <makes a tape reproducer reproduction mode) ] [ > tape 
reproduction -> ] [0027] In this 2nd examplewhen the recognized command 



is equivocalthe same word problem (same words problem) asks 
automatically to a userand is solved by making the apparatus for which 
it asks to a user specify. Although the scenario in this case is the 
same as the language which has the distinctiveness in the 1st example 
formallythat interpretation changes. The 3rd example that clarifies this 
example that specifies the variation of this interpretation is based on 
the scenario which is based on the 2nd examplenamelywas given to the CD 
reproduction device and tape reproducer as network equipment. The 
acceptance language L is formed like the acceptance language L of the 
1st example in that a selection process is performed when recognizing 
the lexical element contained in the common area of the acceptance 
language LI and the acceptance language L2. That isit is L=L1**L2. The 
lexical element which constitutes the acceptance language L of an audio 
station under the conditions of having the lexical element with 
respectively same the acceptance language LI and the acceptance language 
L21ike the 2nd example becomes L= {playbacka stopa skipsound 
recordingand rewinding}, the user network command to which it 
corresponds in this case is a playback ->< clear imitation — it is a 
#1> stop ->< clear imitation — a sound recording [ <is skipped to the 
next track of CD> ] ->< tape reproducer is made into sound recording 
mode — it > rewinds and is -> <a tape is rewound). [ #2> skip -> ] here 
— " — clear — an imitation — #1" means that a voice recognition part 
outputs an inquiry called the command or ("on what device") to which 
apparatusand switches the acceptance language L to acceptance language 
La. La consists of the lexical element La= {CD and tape} and the function 
of a user network command to correspondlt is CD-> <a CD reproduction 
device' s is made into reproduction mode>and <switchbacking to L>and is 
tape -> <a tape reproducer' s is made into reproduction mode>and 
<switchbacking to L>. 

[0028] here — " — clear — an imitation — #2" means that the audio 
station 1 outputs the inquiry to which apparatus whether it is a 
commandand switches the acceptance language L to the acceptance language 
Lb. Lb consists of a lexical element indicated to be Lb=La= {CD and 
tape} and the function of a user network command to correspondlt is CD-> 
<a CD reproduction device' s is made into stop mode>and <switchbacking to 
L>and is Tape-) <a tape reproducer' s is made into stop mode>and 
<switchbacking to L>. 

[0029] In the 3rd example of this inventionthe control management of the 
audio station shown in the 2nd example can also be shortened. In this 
casethe audio station 1 considers that the apparatus used immediately 
before is default apparatusand changes into a corresponding user network 



command a user' s utterance command received and recognized based on the 
general-purpose document which consists of the acceptance language L 
with which the audio station 1 and default apparatus were unified. Also 
when controlling again other apparatus other than the apparatus used 
immediately beforethe control management of an audio station can be 
shortened. In this caseit directs to control apparatus to an audio 
station by only emitting the name of apparatusfor example. 
[0030]Adaptation of a dialog is performed in the 4th example. In the 
above situationsin order to assign an equivocal word correctlyan inquiry 
of apparatus clear statement is outputted to a userand a system 
recognizes a user' s reaction to the inquiry. When the great portion of 
reply of the user to the inquiry for apparatus clear statement is the 
apparatus specified immediately beforean inquiry is skipped and it may 
be made to carry out direct supply of the user network command to last 
apparatus. In order to predict to which apparatus the command was taken 
outtwo or more sources of information may be used. For exampleit not 
only takes into consideration the apparatus used immediately beforebut 
it takes into consideration the points of comparison of each apparatusor 
a classification of apparatus. For examplea record command has a high 
possibility of being not the record to an audiotape but record on 
videotapewhen being taken out while the user is watching television. The 
record command taken out on the other hand while the user is listening 
to radio has a high possibility of being record to an audiotape. Thenit 
may be made to choose the high network equipment of a possibility of 
being used from the network equipmentin which the power supply is 
switched onfor example. A classification of apparatus can also be 
learned by investigating the description about a user' s act or the 
function of apparatus. 

[0031] In all the above-mentioned examplesafteras for the rear stirrup by 
which network equipment was connected to the networkapparatus receives a 
user network command from the audio station leach network equipment can 
transmit an apparatus document to the audio station 1 directly. The 
apparatus document etc. which became independent of the present status 
may be transmitted to an audio station. Apparatus may have two or more 
apparatus documents. Or apparatus may update a dialog and voice 
recognition capabilitywhen changing a function dynamically based on 
change of contents information (contentinformation) and accepting the 
change. Or a function may be periodically changed by transmitting a new 
document to an audio station. An apparatus document may be storedfor 
example in an apparatus document providing device like a device 
manufacturer' s internet serverwithout being stored in corresponding 



apparatus. In this caseif the audio station 1 recognizes it as the 
above-mentioned apparatus being connected to a networkit may download an 
apparatus document. 

[0032] Apparatus can have two or more documents including the acceptance 
language of the whole apparatus by the language of the country where 
each differs. For examplesuch apparatus transmits a German apparatus 
document to an audio station first. Thenafter receiving a command 
corresponding from a useran English apparatus document can be 
transmitted to an audio station. Therebythe audio station can change the 
user command of either German or English into a user network commandand 
can control apparatus. 

[0033]The definition of the pronunciation for the word used when 
performing recognition or composition other than a lexical element may 
be included as an element of an apparatus document. And the definition 
of those pronunciation may be directly unified in the general-purpose 
document of an audio station. When the definition of pronunciation is 
included in languagethe definition of those pronunciation may be used 
for one recognition for apparatusand a synchronizer for exclusive useor 
all the apparatus may share itfor example. That isthe definition of the 
pronunciation introduced by the 1st apparatus can be validated also to 
the word emitted to the 2nd apparatusi. e. other apparatus connected to 
the network. 

[0034] In order to compoundtwo or more apparatus documents may be unified 
and adaptation of a general -purpose document may be performed in an 
audio station. The reply to an inquiry may be performed not only to one 
apparatus but to two or more apparatus. It is necessary to guarantee 
that composition of the utterance from two or more apparatus is not 
outputted simultaneously. Some utterance is produced according to the 
phenomenon of the exterior like warning based on a supply priority. For 
exampleafter those utterance interrupts other utterance and a user' s 
inactive period expiresthe interrupted utterance is reintroduced and a 
dialog continues. While the stream considered that the likelihood which 
is in agreement with a user's input and a user's utterance is high is 
continuingthe stream of two or more dialogs is paralleland is managed on 
the assumption that the fact that all the streams are in an active state. 
[0035]The general-purpose document of an audio station may have a basic 
set of the element which may be empty in first stage or describes the 
interface for 1 or two or more network equipment. In the case of the 
latterthe document transmitted to an audio station from apparatus does 
not have concretenessbut refer to the general-purpose document for it 
selectively. 



[0036]For exampleonly the dialog grammar for specific apparatus may be 
specified in an apparatus documentand the default grammar for the 
expression of utterance may be included in the fundamental set of the 
user command interpretation element of a general-purpose document. A 
certain function may be thoroughly described in the general-purpose 
document of a voice recognition part. For exampleit is described how an 
electronic program guide is controlled by a voice dialog. In this 
casethe variable which shows the information about a program namea maker 
namean actor namea time zoneand a broadcasting day is included in 
default grammar and the grammar for a series of wordsfor exampleand the 
document transmitted to an audio station from an electronic program 
guide device includes only the information by which these variables are 
fulfilled. 

[0037] In other examplesthe apparatus document transmitted to an audio 
station from apparatus contains the keyword and category identifier 
corresponding to a network command. An audio station determines which 
grammar about a word sequence is applied to utterance of the continuing 
user who may be generated based on these category identifiers. For 
examplewhen a keyword is "Voice of America (voice of America)" and a 
category is "radio"a userlt generates saying "I liking to hear Voice of 
America (I want to listen to voice of America)" or "one [ Voice of 
America ] (Turn on voiceof America)." When a category is a "radio 
cassette recorder recorder"utterance of the user who may be generated is 
"recording Voice of America (Please record voice of America)" etc. 
[0038] required (time8:00) in order to start the group of the 
concept/value which forms the foundation of speech comprehension in 
order to perform adaptation of a general-purpose documentfor examplea 
certain processing — it may be made to contain in the apparatus document 
to which the said group is transmitted by the audio station as grammar 
for apparatus As for this rulethe apparatus document which describes a 
voice interface defines [ mapping between a word sequence and a 
conceptand ] how a variable value is buried including a rule. An 
apparatus document has a rule which defines mapping to the operation 
from the group of a couple for a concept/value. For exampleit goes away 
with a (command and record) (title and a windand a group called **)and 
(time and 8:00) expresses the picture recording processing of VTR. A 
dialog may be begun from the user sidenamelythe turn of utterance is 
good also considering a user as initiative. "8 o' clockthe turn of 
utterance is left with a movie and a wind and is **" etc. for example. 
When information is given by 1 or two or more utterancethe turn of 
utterance is left to a user. A system collects information required in 



order to put operation into operation for exarapleby the user network 
command transmitted to each network equipment based on grammar. A user' s 
utterance which this system transmitted the further document that 
includes additional grammarfor example to the audio stationthe audio 
station could ask back information to the user when information was 
missingand has been recognized newly can also be assigned to the further 
user network command. 

[0039] An apparatus document as a user command interpretation element 
Grammar for a lexical elementpronunciationand a word sequenceBesides the 
information about the rule for speech comprehension and a dialogthe 
information for the grammar for the same information corresponding to 
two or more languages assigned to a same or equivalent user network 
commandi. e. a lexical elementpronunciationand a word sequencespeech 
comprehensionand a dialog which carries out rule Seki may also be 
included. The user can control apparatus by this information with any 
languages. As mentioned abovethis information may be included in two or 
more apparatus documents which the same apparatus has. The interface of 
one audio station which can realize processing with the sound of two or 
more languages by this can be specified. Since the pair of the above- 
mentioned concept/value is language independencegrammar of a dialog can 
be considered as language independence. 

[0040] As mentioned abovean apparatus document contains the vocabulary 
given clearly or suggestively by the grammar for utterance. 1 or two or 
more pronunciation can also be arbitrarily given to each lexical 
elementi. e. each word. When arbitrary pronunciation is not given to an 
apparatus documentthis may be generated automatically. Howeverwhen the 
special pronunciation in the case where especially words are a proper 
nouna foreign wordand an abbreviationa dialector a foreign language must 
be recognized in this caseit is easy to generate an error. Pronunciation 
serves as the foundation for generating the model of a word in the 
recognition part to which the set of the model of a single sound or an 
allophone was given. 

[0041]An apparatus document besides the user command interpretation 
element which creates the compound command about two or more available 
apparatusand a related user commandThe information about functionslike 
"the standardized informationfor examplethe information about the 
category of apparatus like "VTR"and it has a recordable tape" may also 
be included. For examplea user emitssaying "record this movie (Please 
record this movie) "when choosing the movie of the schedule broadcast in 
the future from the electronic program guide displayed on a television 
set. That isa network command needs to transmit the information about a 



suitable channela dateand a time zone to VTR from a program guide device. 
And it is necessary to transmit the command which guarantees that 
suitable recording is performed to VTR. As a situation similar to thisa 
movie may be dubbed from a certain VTR tape to another VTR tape. The 
voice interface description of each apparatus does not usually define 
the utterance about such utterancei. e. a suitable channela datea time 
zoneetc. by such a situation. Thereforean audio station may carry out 
adaptation of the general-purpose document based on reasoning by a 
reasoning part. The reasoning part is contained in the apparatus 
document as a reasoning componentand provides the additional function 
which may be included in one of the compound command about two or more 
apparatus with description of the voice interface containing the grammar 
and the dialog about a word sequence. 

[0042]Since all the information for controlling network equipment is 
included in one general-purpose document in an audio stationit can 
conduct processing or analysis easily. It is easy especially when 
[ especially ] relation is between the double information assigned to 
the same or equivalent user network command between the available 
contents from which a user command interpretation element diff ersor/and 
in a general-purpose document. 

[0043]Thereforeaccording to this invent ionnetwork equipment controllable 
via the audio station with which a network is provided can transmit the 
apparatus document which describes the function of apparatusand a voice 
interface to an audio station. An audio station unifies an apparatus 
document in a general-purpose document. A general-purpose document forms 
the foundation for changing the recognized user command into a user 
network command using a user command interpretation elementin order to 
control the connected network equipment. An apparatus document consists 
of a vocabulary like for examplea user command interpretation element 
and a related user network command. The rule for the grammar not only 
for the same information as these corresponding to two or more languages 
or the information about the dynamic dialog in speech comprehension but 
pronunciation and a word sequencespeech comprehensionand a dialog may be 
included in the user command interpretation element of an apparatus 
document. One apparatus has two or more apparatus documentsand may 
transmit them to an audio station dynamically at the time of necessity. 
Since network equipment transmits the specification about a phonetic 
function to an audio station at the time of executionapparatus can 
change the function dynamically by applying this invention based on 
change of the contents of the apparatus document. When apparatus changes 
the function dynamical lynetwork equipment generates an apparatus 



document dynamical lyor updates the existing apparatus document. Updating 
is performed by updating or inserting the name of a broadcasting 
stationfor exampleand network equipment transmits the updated apparatus 
document to an audio station. 
[0044] 

[Effect of the Invention]As mentioned abovein the control method of the 
network equipment concerning this invention. Correspond to network 
equipment and at least one apparatus document including the language 
which consists of at least one pair of the user network command relevant 
to a user command interpretation element is receivedadaptation of the 
received apparatus document is carried out to the general-purpose 
document which consists of a language of an apparatus documentand a 
language of the audio station constituted similarly — adaptation being 
carried out andA user' s voice commanding received and recognized is 
changed into a corresponding user network command based on all the pairs 
of a user command interpretation element and the related user network 
command included in a general purpose language. Therebyaccording to the 
control method of the network equipment concerning this inventionthe 
user network command which controls two or more network equipment 
connected to the network provided with an audio station can be 
managedprocessed and analyzed simply and efficiently. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is a block diagram showing the composition of the audio 
station which applied this invention. 

[Drawing 2] It is a block diagram showing the composition of the network 
equipment which applied this invention. 

[Drawing 3] It is a block diagram showing the composition of the 
conventional audio station. 
[Description of Notations] 

1 An audio station and 2 [ Bus ] A microphone3 DSP4 CPU5 link-layer 
control section6 I/F physical layer partand 7 A memory and 8 A memory9 
DSPten loudspeakersand 20 
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[«WO«r*fiBllf»B3 *HB^li. *-Zx*vh"7- 

Tiatittf«:jsifi-r«ctoT»***m-r>*-7i-rx 
«Hio*ijtt&aicBr*o *«w*afflLfc*Btt. 

X\f\£7tT-7\s^-¥ (VTR) HWA-K^iT 7 
K (electronic programming guide) H©V7h7i7 
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[«aEOa«f] R#H1tlM5B4aBE P-A-9 7 1 1 8 
©^tl^IE^-r^lgftSO-^Y >^-7 i^XHp 

8«(c3iflrr««Hi!B«BwS'nTi' k *o s^ga^i^ § 

[0 0 0 3] 1 998^9^lC, =E b □— 5tt (Motoro 
la) tt, ffi3iRltgT-^7 7 -y^SfgXML (Extensible 
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ext-to-speech synthesis" by Richard Sproat et a I. 
ESCA. Eurospeech 97, Rhodes, Greece, ISSN 1018-407 
4, page 1774) Kli, ^£^+3. h^-^T'-y 7°s-|§ 
(spoken text markup language) (STML) j^Ht^ 
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Xt°-*1 0, /tX2 0tCjja**tlTV*. v<7 
□ * V 2 IE A73*tlfc«*lC*-3 < A7Jfl»tt» * =E U 
ZatepmLltrVZlWyi-iV-fu-kv*)- (DSP) 

3T»«ffl**ifc«» ^^Siaa^a (cpu) 4K« 
is* n^o c p u 4 «: % amess* ^U9a 

LfcDSP9*^LTXtf-*1 0lcai73U 
li, UV^KttWWSStf I / FUfflK+Se* 
^LT/U2 0tCtt7J-r*o DSP3&tfCPU4tt» 
^yaicy^-teXTSCifcT?**,, /tysfctt* 

SSKU DSP3t* % ^^';7(0^«W* 
a»7e(C7'^-lrX-r*o *ty7(«:W\ ^PlSIi&tf 

[0014] ^©w^KT-ii. *«*nfciffl«)* 

[0 0 15] H2& *aW*afflLfc*y h 7—7$ 
1 W\ CPU 1 2*mX2>o CPU 1 2tt % *v h7- 



oicaj7j-T5o Ti-eic. cpui2W\ ^t'j 1 
*y 1 3tt» sp-r>*-7i-rxs«r4t)-6» ix 

[0 0 16] ±&<D«fc5te % *y h9—7«l§1 Hi, 
Lfc*7LTlvS:<Tt>J:l\, *y h 7—7888 1 1 ©W 
JlttgStC^yjSttLTt^lA, ^Y>^-7i^X 

[0 0 17] wTic«iwra*aiwo*M«!Hcfei , 'T 

W\ 1 OO^pgSI h7— *lC2 0©* 

PgB 1 ®sAfflX»ttffi?» 2 OO** h9-^«S 1 
1 ©Wf§££ilT3 2 oo«»x»W\ WP£B 1 rtT 

81 1 £DlSt±. 2W±0)ff«ODnffl?»oT'b«fcL\ $ 

^pgai©;AfflS:»^iffcii:§fi?nfc«^» 
ic«^T»&fb*-a i-XTT«, wr& 

[0 0 18] JilTOWWIfcfi^T* L 1 (* % S3«1i 
(accepted language) SB 1 <D«H|0!)ffift5 

*Sl»*JSr*3V>K*/T5ro H*lcL2tt. §12 <D 

«»oiHHsi6ai«*(sr*3^>K*^ro 

SWCfetitf, L 1 It. <bfc 1 O<0H»S*, 

to-SJIlHwitCOJIllBwituWjS-rsn-lf^y r-7- 

*H5UO/-ci60SS6atf/XW:if|5aj» (speech 
understand i ng) Stf^fSOfcfe^^U^^A/Tfe J: 

[0 0 19] S§ 1 (DmMmiZtS^Ttt, L I <h L 2tt, 

iibT, LinL2= {} TSt;. -0*-7i-fXlB 
a©fci&lcT-5>**lfcSa»H L (4 % L = L 1 U L 2 

^St^SfgL 1 fc<fcyiB»*ftfc*HIX»1 <k> ^2(D 
«IIS<7)slgL 2 ltd; y BSfi*nfc«MIX»2 ^6f»5*l 

y n^o l 1 1 l 2 im irsfftB**^**^® 

T\ fg^^ti^r S □ T > K <t <h tc^©fgftg^ 
-1f*y K7-7=i?>Ri, awtcSrtStiT, »J6 



S5„ C©<!:^ 7 L ues>a>-tz-y hlC^T^L 1 «*: 
C D 7U— WcttJ&TS L 2 tt^-n^tu WBWfc? 
WT©»*B*frS4*. 
[002 1] L 1 = {MTV, CNN} 
L2= {»£, ffit} 

L 1 <k L 2 tt. If] i;H»MI6*^Sftt\ -r^=b"B L 1 

ni_2= {} Tifcs©?, -r>*-7i-rxiaa©7- 

v''*n/c§SsfiLti, L = L1UL2= {MTV, C 
NN, #it} t^^ 0 £ttb©i§*gf£ 

[0022] 

MTV^<xbt^a>£MTVK«y&7t3> 
CNN^<xUei>a>£CNNKWy&;tS> 
BS-»<CDyu— \»*Bft ; E-Rc-r*> 
fflt-»< C D -} Is— V^±=E- RcT3> 

L/rL&tfs, 2oo«»fl«ipii:iaHBifrftt)-BiRii: 
cfctii*; cns«aoiaK.K»jRri6?*«. **»© 

&=bt5 L 1 t L 2 fg©'J>& < £«fefta»#«B«T*H 

i;ffift££fcirBxi*ftB£ft*. Ltct^x. a-v 
tt, n-»f©»is=]^> K*ma©a«aw)*»©m(«: 

[0 0 2 3] «T©B2©*JHM£;J3^T-W\ «HKD« 

fStt*ziTV K©tticttijn;!rtu -f x©f3 
asj&ar **? li^to l « % 2 o©wfg l 1 1 l 2 © 

IB7«ra*n*fi*n<0ft^#»i!)«f^i:«M|(0:&»tf 
tm*ftfc«»©»K?S7!rft*o iiS©;&fl>£*-*l? 
nn 1S0 : n2<fr§<t, 

L = L1\(LinL2) UL2\ (L1flL2) Un 
1 L 1 Un 2 L2 

[0 0 2 4] J-XTTt^ C DSSSBStfx-TWSS 
B£ffi§*3*-y r-^-^GOU:, ±2E©ltffitt€ffiE|ll 
U *»W(i:«fc**y H7-*««©IMW*SS*iB5fr 

tlftU TCDJ, rr-^j i^jntt^o CD 

h££b©53*i&l 1 <t^-ys^garo§gmiiL 

[0 0 2 5] L1 = {«£, ff±. 
L2= {«£, £1^, ffct, MSU 
*P-r>*-7i-fX©Sa»«tt, L= (L1\ (L 
1DL2) U (L2\ (L 1 HL2) Un1LlUn2 
L2= U**X StUU CDSS, CD<| 

it. cdx**?, x-tbb, x-^±, 



commit. j-xT<7)<fcatc^s 0 

[002 6] 

X* v C D ©*© h ^ «y v S > 

C DS£-»<C DB^SBfcBftt- KlC-T*> 
C D#it-K C DB£»B0»£*#itf *> 
C 7->< C D©*© h77 ^X+v ^r§> 

T-7B£-*<r-7B4gl?SS : E- Rcr*> 

> 

[0 0 2 7] C©B 2 ©*««?«* ^-JjiffifflB (sa 

me words problem) li* SSBStlfcUW Ktf^BWT 

n^o y*tt» matttcttx fgi©*sfe 

fltea3tt*B»Jtt£*rT* sift HO *©&? 
fRtfBto*, il«©/('Ji->a VfcWjSTSC© 
BifiMfcHHiUcT SS& 3 ©HMOl*. B 2 ©BffiflQKS 

^Tfiy, rftt»-B, *v h-7-^BBtLTocD 

^ti^o ssmigui, sastgL i tssmigL 

2©«W»»(«:^sn*HBB«€BKr*t*tcaR 
JGStffrfeft* <t ^ a jS?(*. B 1 ©£ffig|©S3ffffi 

Ltraa»cm«*n* 0 rftt)-5, l=liul2t* 

So «2 ©*iHM©«fc 5 fc, §SbI§L 1 tSSSIiL 
2 *-ft*ftlRl IJH«B*&%T* il^*fM)fc£ 

t\ wjssbcssh i§ l *m$tfz>mmmmz. l = 
{#£, #jt. T&v-j. am. mzmu c 

SS-»<B^5ttc-yr«fc# i > 
^±-><B^5ltJ:-S<};#2> 

©#© h 5 7 <7 y Tt Z> > 

if©«IHcS*-r*aT>K^ ("on what device") 
<B»J««il^C4*ii*t5. Lali, La= {C 

*y r-7—?=lT> K©«l6tt» C D-»<C DB^SB 

^B^t- K^TS>SO : < L ICX'T >y x/\'y ^tS> 

tfty» x-r-x^-^sflfeaBtiist-Rcr* 

[0 0 2 8] SfcCCTs rB^cstc-arj: # 2 j tt, ^ 
SB1 ^©«HHc*fr*n-e> K^tl^dlBi^to-a: 

^ai73LT^ asw§L*aawBLbtc:«iy»**t 



>y h7-*37V K©B«W\ C D^<C DB£gB£ 

»jt : E-Rc-r*>atf<Hcz-fy^/^y^-r*>t 

fctK Tap e-Kx-yfS^gfifcffit*- RcT* 
[0 0 2 9] *«WOlS3©*M«yi«:fi^Ts 31 2 ©H 

#§o c©«^ w^sbi fcmffium^snfcBs* 

Sn— if*y h7— 5^3^>RcB»r*o EBfcBl^ 

|S»BOW»«Bl*Jittr*Ci:fl<T*«o C©«^ 
«jttfiNcBS0***«-rSE£fcJ:oT\ Bf§£§iJ 
»T*<fc?B»BBteftiS*-*o 
[0 0 3 0] »40**«7?tt, ^-ry'n^OBJSfbtf 

r*iL-lfOfiK*B«r*«, «BW^Ofc46©Bl^ 

BBTfcSB^ Bl/^trtsSrS^+yyU a— «f*v 
h 7 - * ire > KSBttOBBKBttttg-r * <fc 5 tc L 
Tfc<fcl\ □ TVhWOBSfcttLTttiSftfc 

mi*; Bwtcffiffl* tife«»**ar < * 
*B»oa«wsxtt«»®»a«b*jir«o b*i& is 

Ri, a— ytfxUfcrSBTl^WJciil^n* 
ii^ti, *x~7'\©fE§lTHi& < , bfT*x 

*x-7*^©lBB?**RrtltttfBl\ fCT, 
ftSRTBttCBl^v h7-*«B*» ffil^idfBBtffi 
A*nTVS*v h ^-^BSfr 5BKT* J: 5 KLT 
fc«fcl\ $fc» B8<Z)£9!& n— WDfri&XttBS© 

bbkbt *iaa*BBf * c t ic <fc q t^b-t * £ <t 

[003 1] ±a©^T<3!)**««K*^T» *V h7- 
*BStf*v h7-*ltfe«*nfc»Xtt, ^fcS&tfa- 
+f*y KfcBWKBl frSSBLfcB 

bssbtse w&qzt-zz&z 

aiL/-c«B*»»*B^»BKa6fllLTt)«»:t\ BB 
tt, «a©BBS»**LT^r*.«fcl\ Sfcl*. BB 
H\ AB1KB (content information) OBStcX^T 
»«fcBtE£B*T* Zomtzm&ZtZlzV'fTa 
^atfBmB«B**JEBfLT'E»«fel\ StU 
^XBtcBmBBtcaMrr^CtfCfcoT, £BttKB 

*fc*7*.5 -7-± M •> tffcSBTV*/+ *fr*--a-z.**BErhlri*4ih 



BAfcRBBHJfiB©-r>*-*y h+t-M'© 

x^ztwmrzts bb&*£ > p - k lt*. <fc 
[0032] bb». *n?ntf*ft*B©mnc«fc* 

B8^»<0SBBH**fc«»OXB*^-r * 3 i ? 

flfclf* £©<fc?&8l&ti. BWlchVyBOB 

r«3^VK*S«Lfc»lc, £ffi<DBB&B«SP£ 

wizxzmTzztftT^Za c*Ucj:y, 

*>y h7—?^T> KfcBBLTB8*M»*-3Citf 

[0 0 3 3] ffiftBXaf&t^ BBXtt^J8*fr3t* 
lcJB^Sft*H®fctt©«B<Dj£B*BB:fcB©BB£ 
LT^S*Tt>«feL\ ^LT, *nS©B#©J£B*S 

TOfcSSn**:*, *ft5©«S©£»W:x Billf, 
1 o©B8ffl©B«Rtf&l8»U:»M;:B^Tfc«fc<, 
Xtt, £T©«8?««LTfcJ:lA, Tftfe^ BIO 
Billed; oTBAS-illfcBBOSBf*. 8l2©t&&. t 

[0 0 3 4] £j**?f3fc4&lcu «»©B8&B«1ft£ 
U «SBBfcT\ 3UBXB©BlSfb*fToT J fcJ:i\> 

©BSK:»LT?T-3T*«fcl\ «»©BBfr5©BB® 

*o RIS©*leW:* «IS«ft«n:*^T, B?t(«1& 

^■ftSOBBtt, f60BK(c|ijya*» n-tf©p-SS) 

fflBflm>*t, iiyasnfeBiStt!i«A*nx ^ 

7P^(*«HT-r*o a— yoA^atfa-yoBBK- 

*b* ^TOAhy-ix^^x-f^attBtcabst^ 

a^*^«F*l<»:LT, BRO^'fy'P^fflAhy-^ 

[0 0 3 5] ^gS©;RfflX«tis ffifflWlCli^T'S 
oTt».i:<, **^tt. IXliBft©** K7-*B8 
OfcttflDs •0*-7i-rA*IB3ar*B«0»*-by 

h*^LTi"Tt.«j:i\ b#©»^» mm^nmm 
^ic#Hg-r 5 o T t <fc l\ 

[0 0 3 6] BAtf» »SOBS0!)fc46ffl4f'r7 , P^S: 
»flD**BSX»fc£l/»T1$£ L> fgfSco a ^0 L©fc 



;UhXSatf-»Oi|l»3!)/S:»fl!>Xai!:W:» « 
KfESfc* tiNR* BSIBWaif8c3£BlEHT*1f 
a*wT*tttf^StlT*ys K»BfrS 

[0 0 3 7] fftOSMMStcfe^TW:, M»»6f^l 
fc^*ft*«»X»a:» *v h7-*PT>Rc*f)£ 

ttraa-yoRKtcWLT, *h$uu:bt*£®x-s 
#0*1*, *-7- k# w 

X-f?'7)< l Jj3 (voice of America) j T*S l A * 

*7*« TVUft^K^/cl/ 1 (I want to listen to voic 
e of America) j Xti • • TV 'J££:t> 

(Turn on voiceof America) j Wt96£.T&o Sfc» 

4 U§* a— «f oaBtt. rtf'-r X • * 7 • T 7 * U £ £ 
tUta (Please record voice of America) j WZ'&'bo 
[0 0 3 8] 5Wl»0ffl(fflb*fT5fc4&te % W^Sft? 

rafttticiEsss: am 8 : oo) 

SB(cS£ffl« nSMttXttKBiftHOS&J: LT^Stf 

Ttj;i\ f^>^-7i^x fctBiE-r *«sx» 

■y £>?»#£©«* 5 (csft««si6«^«£«r «. 

^•yt;>^*s«-r*«»j**r*o (pv> 
k, ibs) , (*-rr-;k s£i*,u:sy*M aif (ps 

$J, 8: 00) £l>5fiW\ VTR©S§iE$a3£81-c, 
%B©IH«li« a— tf*±»t LTti«fel\ #§f£©IH# 

tit. mtui. rs^ wm. mtt&ic&j&i 
tf, xmmrSttLT&z-v h7-^^s§ic^fi*ti 

fca-1f*v r-7-^=l^> Kfc<fc^T«rflF**&irr* 
U £*£B& ItatffcSLZV**^ =L-mzffi 

[0039] =L—*fzi^ym%mm£L 

Tx B*SJ& *H5U©fci&©XSfc WPSft?K 
t/?<7 7 P?©fc46©&MUlC|?ar5lfa©fteK, B-X 
ttBtW)a-Wy h7— 7P?> Klcffly STStl* 



«*SH©**u:«fc3tta**«Tse:itfT 1 ** i o 

©WBB©-r>*- , 7x'rx*1$r?*#*. ±!B©« 
*/«0»W:mBSfc3i7»»«0^ ^ZP ?©X;£ti 

[0 0 4 0] ±>ifi<DJ:dlC, 5£l£©fc4&© 

1 XUtt&<Z>a£«&ffift£Jfc T4b«*R 
fc»LT«*fc:tt4-r* ffit©««fl» 

t«fcl\ LfrLE©*^ ttt::J|iRtfB*r«n« ft* 

[0041] mmcomm^m^mmizmr 

aar rvTRj ©«fe3«:«iia)*7 i iryicB8r*if«& 
tf/xi* laaqriia^-T'^wrsj scbbicbt 

S1*B*«A/?t»J:iA, fllAlfn.— tft*. f^->*a> 

■fey HcajSTSrtiswHHBff-r K^s»3i5«i»*n* 

*S©l«ll*3BJR-r«i:*s rco«H*aHLTT* 

(Please record this movie) j tWt%> 0 -3St), 

^IBRSJCBB-T *flMB*»li)J-r KSBft^b V T R ItiMfi 

(CSHK LfcttSEt LTx B*SJ^fe5 V T R x-^SSU 
<7)VTRx-7 , lC^e>y-r^^fe5o CCDcfed^: 

[0 0 4 2] *y h9-^aH»*W»r«fctt©^TO 
-+rp^> KI!HRBX(0JIQ:«¥iJffiqrieftM8nXtt/ 



[0043] LfctfoT, *»WKJ:*fc, 

X»l*x S^*nfc^7 h7-^mg£$ij$-t3fc#> 
fc» lSi^*nfclL-+f' =1 y'V K£iL-+f =i Kfgfflg 

«»sww:ffiiAtfiL— yiiTv kj» 
»©«sx»*wu «£i5tc*-fts&Hitttc$*£a 

[0044] 

BSE®***] J-X±©J;5lc, h"7- 



t> K©'J>fc< <h& 1 o©ttfrStt*W§*afc<MJ:< 

^x»©sf§£^«ic«ja*ftssisgii©m3§fr5 

fen— «f©SJS=i^v K£\ :i— K»8?£*£ 
5&ETOte3£ti*BaT*a-lf*v h7-^HT> 
F©£T©*Hc«-3^T. »*Sr*a— »f*y h 1 ?-* 

h 7 — * icffiBgtffifcaB*©* h7-^*$fg£»-t 
*i-+f*y f =iv>K*IBfaB.-3«i*«(c« 

[BBB©ffi¥&tti&] 

[0 1 ] *fg^^iiffl Ltcetmrnmom^m^n y 

*B?*S„ 

[0 3 ] mkv>^nmm.<»mi&*7K?yu v * m?& 

So 

1 apgl, 2 W?nfc*s s 3 DSP, 4 C 

pu, s ij>*mm»s 6 I/FWIBW, 7 

^t'A 8 9 DSP, 1 OXfcf-ft, 20 

/U 



[01] 




[E92] 




(5D int. ci. 7 kbiib? 

H0 4Q 9/00 3 3 1 



F I 

G 1 0 L 3/00 



f-7>K (#%) 

5 6 1 H 



K-ryaWftiBH 70736 7i 

/l/A*-y/v>o. h o y hofilW- ->a H 
5— tr 106 V~- -TV*— *-->3* 
;U (a-dv/t) ^-tr/l/->-V7 h 5-y 

hoyhofifrh 7-9Sa*?—k>*- 

(72)RW# ->;U4r zjp>>>- 

Kf yjS£9±ffi[l 70736 7i 

/l//b/\5/ih*yh')fliW- ->ah 

5— b 106 y-- -fy^-^-s/a^- 

;U (3-P-y/\°) $Mf/U-> + 7h 5y 

(72)«W# 5/1/7 

K'T^^ftfPffl 70736 7 1 

;l//fy/\5/a f- * y hOll)\/$- 5/ a h 
7— tr 106 y-- -r>*-ti/3t 
/I/ (3-p<y/\°) 5y 

P*3 



K-rya»ftWH tV 70736 7x 

hoy hoti'il/Z- ->ih 
5— tr 106 y~- ^> 
;U (3-P-y/\°) y4f;U-> + 7h 5v 

hoy hofihh 7^/Pi>'-b^- 

(72)»W# 75 >7 5?n> 

K-T 70736 7 1 

/l/M'yMi/;i h 0 y hOfi'll$- 5/3. h 

=>— tr 106 y~- <>^-^-->3^- 
;l> ^-t?/l/v/-\>7 h 5v 

h ^ii/V^fib j\yyy<f ~y=L 
hoy hofiibh T^/P->'-t^- 

(72)SWJ* 'Vl/A-h- ;Uyfr 

K-pyaaBtani xV — 70736 71 

;l/M'<y/\~>n. hoy hofiiW- h 
5— tr 106 y~- 7-$/ g *- 
;U (3-P7/\°) y-tr/U->*7h =7 
h s<~yjLisy? : rii' /\7>yy<? i/j. 
hoyhofiilh tVSuV— fe>*- 



